Natural Language Technology in Precision Content Retrieval
نویسندگان
چکیده
This paper describes a new approach to information access that combines techniques from natural language processing and knowledge representation with a new technique for relevance estimation and passage retrieval. Unlike many attempts to combine natural language processing with information retrieval, these results show significant benefit from using linguistic knowledge. Subsumption technology is used to automatically integrate syntactic, semantic, and morphological relationships among concepts that occur in the material, and to organize them into a structured conceptual taxonomy that is efficiently usable by retrieval algorithms and also effective for browsing.
منابع مشابه
Sapere: Improving the Precision of Information Retrieval Systems Using Syntactic Relations
The Problem: Traditional information retrieval systems based on the “bag-of-words” paradigm cannot capture the semantic content of documents. While these systems are relatively robust and have high recall, they suffer from very poor precision. On the other hand, it is impossible with current technology to build a practical information access system that fully analyzes and understands unrestrict...
متن کاملImproving the Precision of Information Retrieval Systems Using Syntactic Relations
The Problem: Traditional information retrieval systems based on the “bag-of-words” paradigm cannot capture the semantic content of documents. While these systems are relatively robust and have high recall, they suffer from very poor precision. On the other hand, it is impossible with current technology to build a practical information access system that fully analyzes and understands unrestrict...
متن کاملExploiting a Large Thesaurus for Information Retrieval
1. Background Accuracy in information retrieval, that is, achieving both high recall and precision, is challenging because the relationship between natural language and semantic conceptual structure is not straightforward. However, effective retrieval requires that the semantic conceptual structure (or content) of both queries and documents be known. Natural language processing is one way to
متن کاملContent Based Radiographic Images Indexing and Retrieval Using Pattern Orientation Histogram
Introduction: Content Based Image Retrieval (CBIR) is a method of image searching and retrieval in a database. In medical applications, CBIR is a tool used by physicians to compare the previous and current medical images associated with patients pathological conditions. As the volume of pictorial information stored in medical image databases is in progress, efficient image indexing and retri...
متن کاملIndexing and search of multimodal information
The Informedia Digital Library Project allows full content indexing and retrieval of text, audio and video material. The integration of speech recognition, image processing, natural language processing and information retrieval overcomes limits in each technology to create a useful system. In order to answer the question how good speech recognition has to be in order to be useful and usable for...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998